Source-filter models for time-scale pitch-scale modification of speech

نویسنده

  • Alex Acero
چکیده

This paper presents two time-scale pitch-scale modification techniques to be used in speech synthesis systems. They have been applied to Microsoft’s Whistler system, which is based on concatenative synthesis. Both methods are based on a sourcefilter model, one of them using LPC parameters and the other one using cepstral parameters. The proposed methods achieve high quality prosody modification, retain the characteristics of the donor speaker, allow for spectral manipulation (to reduce spectral discontinuities at unit boundaries), yield compact acoustic inventories and improved voiced fricatives.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Voice source analysis for pitch-scale modification of speech signals

Much research has shown that the voice source has strong influence on the quality of speech processing [4][5][6]. But in most of the existing speech modification algorithms, the effect of the voice source variation is neglected. This work explains why the existing modification scheme can’t truly reflect the voice source variation during pitch modification. We use synthesized voiced speech sound...

متن کامل

A mixed-excitation frequency domain model for time-scale pitch-scale modification of speech

This paper presents a time-scale pitch-scale modification technique for concatenative speech synthesis. The method is based on a frequency domain source-filter model, where the source is modeled as a mixed excitation. This model is highly coupled with a compression scheme that result in compact acoustic inventories. When compared to the approach in the Whistler system using no mixed excitation,...

متن کامل

Aspiration Noise during Phonation: Synthesis, Analysis, and Pitch-scale Modification

The current study investigates the synthesis and analysis of aspiration noise in synthesized and spoken vowels. Based on the linear source-filter model of speech production, we implement a vowel synthesizer in which the aspiration noise source is temporally modulated by the periodic source waveform. Modulations in the noise source waveform and their synchrony with the periodic source are shown ...

متن کامل

Effects of Pitch Contours Stylization and Time Scale Modification on Natural Speech Synthesis

This paper describes the method of generation of intonated speech for natural speech synthesis using prosody generation model. The effect of pitch modification through pitch contour stylization for parameter extraction and time scale modification for it’s implementation has been mentioned. An approach for close-copy syllabic stylization has been described. In the latter part, algorithm for impl...

متن کامل

Real-time Pitch-shifting of Musical Signals by a Time-varying Factor Using Normalized Filtered Correlation Time-scale Modification (nfc-tsm)

This paper presents a high-quality real-time pitch-shifting algorithm with a time-varying factor for monophonic audio and musical signals. The pitch-shifting algorithm is based on the resampling and time-scale modification method. A new time-scale modification method has been developed which is called the Normalized Filtered Correlation Time-Scale Modification (NFC-TSM) method. It uses a ring b...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998